Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 86
Filtrar
2.
Hum Genet ; 141(11): 1739-1748, 2022 Nov.
Artículo en Inglés | MEDLINE | ID: mdl-35226188

RESUMEN

Uterine fibroids (UF) are common pelvic tumors in women, heritable, and genome-wide association studies (GWAS) have identified ~ 30 loci associated with increased risk in UF. Using summary statistics from a previously published UF GWAS performed in a non-Hispanic European Ancestry (NHW) female subset from the Electronic Medical Records and Genomics (eMERGE) Network, we constructed a polygenic risk score (PRS) for UF. UF-PRS was developed using PRSice and optimized in the separate clinical population of BioVU. PRS was validated using parallel methods of 10-fold cross-validation logistic regression and phenome-wide association study (PheWAS) in a seperate subset of eMERGE NHW females (validation set), excluding samples used in GWAS. PRSice determined pt < 0.001 and after linkage disequilibrium pruning (r2 < 0.2), 4458 variants were in the PRS which was significant (pseudo-R2 = 0.0018, p = 0.041). 10-fold cross-validation logistic regression modeling of validation set revealed the model had an area under the curve (AUC) value of 0.60 (95% confidence interval [CI] 0.58-0.62) when plotted in a receiver operator curve (ROC). PheWAS identified six phecodes associated with the PRS with the most significant phenotypes being 218 'benign neoplasm of uterus' and 218.1 'uterine leiomyoma' (p = 1.94 × 10-23, OR 1.31 [95% CI 1.26-1.37] and p = 3.50 × 10-23, OR 1.32 [95% CI 1.26-1.37]). We have developed and validated the first PRS for UF. We find our PRS has predictive ability for UF and captures genetic architecture of increased risk for UF that can be used in further studies.


Asunto(s)
Estudio de Asociación del Genoma Completo , Leiomioma , Femenino , Predisposición Genética a la Enfermedad , Genómica , Humanos , Leiomioma/genética , Desequilibrio de Ligamiento , Factores de Riesgo
3.
Nature ; 600(7890): 675-679, 2021 12.
Artículo en Inglés | MEDLINE | ID: mdl-34887591

RESUMEN

Increased blood lipid levels are heritable risk factors of cardiovascular disease with varied prevalence worldwide owing to different dietary patterns and medication use1. Despite advances in prevention and treatment, in particular through reducing low-density lipoprotein cholesterol levels2, heart disease remains the leading cause of death worldwide3. Genome-wideassociation studies (GWAS) of blood lipid levels have led to important biological and clinical insights, as well as new drug targets, for cardiovascular disease. However, most previous GWAS4-23 have been conducted in European ancestry populations and may have missed genetic variants that contribute to lipid-level variation in other ancestry groups. These include differences in allele frequencies, effect sizes and linkage-disequilibrium patterns24. Here we conduct a multi-ancestry, genome-wide genetic discovery meta-analysis of lipid levels in approximately 1.65 million individuals, including 350,000 of non-European ancestries. We quantify the gain in studying non-European ancestries and provide evidence to support the expansion of recruitment of additional ancestries, even with relatively small sample sizes. We find that increasing diversity rather than studying additional individuals of European ancestry results in substantial improvements in fine-mapping functional variants and portability of polygenic prediction (evaluated in approximately 295,000 individuals from 7 ancestry groupings). Modest gains in the number of discovered loci and ancestry-specific variants were also achieved. As GWAS expand emphasis beyond the identification of genes and fundamental biology towards the use of genetic variants for preventive and precision medicine25, we anticipate that increased diversity of participants will lead to more accurate and equitable26 application of polygenic scores in clinical practice.


Asunto(s)
Enfermedades Cardiovasculares , Estudio de Asociación del Genoma Completo , Enfermedades Cardiovasculares/genética , Predisposición Genética a la Enfermedad/genética , Estudio de Asociación del Genoma Completo/métodos , Humanos , Desequilibrio de Ligamiento , Herencia Multifactorial , Polimorfismo de Nucleótido Simple/genética , Grupos de Población
4.
PLoS Genet ; 17(6): e1009534, 2021 06.
Artículo en Inglés | MEDLINE | ID: mdl-34086673

RESUMEN

Assumptions are made about the genetic model of single nucleotide polymorphisms (SNPs) when choosing a traditional genetic encoding: additive, dominant, and recessive. Furthermore, SNPs across the genome are unlikely to demonstrate identical genetic models. However, running SNP-SNP interaction analyses with every combination of encodings raises the multiple testing burden. Here, we present a novel and flexible encoding for genetic interactions, the elastic data-driven genetic encoding (EDGE), in which SNPs are assigned a heterozygous value based on the genetic model they demonstrate in a dataset prior to interaction testing. We assessed the power of EDGE to detect genetic interactions using 29 combinations of simulated genetic models and found it outperformed the traditional encoding methods across 10%, 30%, and 50% minor allele frequencies (MAFs). Further, EDGE maintained a low false-positive rate, while additive and dominant encodings demonstrated inflation. We evaluated EDGE and the traditional encodings with genetic data from the Electronic Medical Records and Genomics (eMERGE) Network for five phenotypes: age-related macular degeneration (AMD), age-related cataract, glaucoma, type 2 diabetes (T2D), and resistant hypertension. A multi-encoding genome-wide association study (GWAS) for each phenotype was performed using the traditional encodings, and the top results of the multi-encoding GWAS were considered for SNP-SNP interaction using the traditional encodings and EDGE. EDGE identified a novel SNP-SNP interaction for age-related cataract that no other method identified: rs7787286 (MAF: 0.041; intergenic region of chromosome 7)-rs4695885 (MAF: 0.34; intergenic region of chromosome 4) with a Bonferroni LRT p of 0.018. A SNP-SNP interaction was found in data from the UK Biobank within 25 kb of these SNPs using the recessive encoding: rs60374751 (MAF: 0.030) and rs6843594 (MAF: 0.34) (Bonferroni LRT p: 0.026). We recommend using EDGE to flexibly detect interactions between SNPs exhibiting diverse action.


Asunto(s)
Modelos Genéticos , Catarata/genética , Conjuntos de Datos como Asunto , Diabetes Mellitus Tipo 2/genética , Frecuencia de los Genes , Estudio de Asociación del Genoma Completo , Glaucoma/genética , Humanos , Hipertensión/genética , Degeneración Macular/genética , Fenotipo , Polimorfismo de Nucleótido Simple
5.
Genet Epidemiol ; 45(1): 4-15, 2021 02.
Artículo en Inglés | MEDLINE | ID: mdl-32964493

RESUMEN

Carotid artery atherosclerotic disease (CAAD) is a risk factor for stroke. We used a genome-wide association (GWAS) approach to discover genetic variants associated with CAAD in participants in the electronic Medical Records and Genomics (eMERGE) Network. We identified adult CAAD cases with unilateral or bilateral carotid artery stenosis and controls without evidence of stenosis from electronic health records at eight eMERGE sites. We performed GWAS with a model adjusting for age, sex, study site, and genetic principal components of ancestry. In eMERGE we found 1793 CAAD cases and 17,958 controls. Two loci reached genome-wide significance, on chr6 in LPA (rs10455872, odds ratio [OR] (95% confidence interval [CI]) = 1.50 (1.30-1.73), p = 2.1 × 10-8 ) and on chr7, an intergenic single nucleotide variant (SNV; rs6952610, OR (95% CI) = 1.25 (1.16-1.36), p = 4.3 × 10-8 ). The chr7 association remained significant in the presence of the LPA SNV as a covariate. The LPA SNV was also associated with coronary heart disease (CHD; 4199 cases and 11,679 controls) in this study (OR (95% CI) = 1.27 (1.13-1.43), p = 5 × 10-5 ) but the chr7 SNV was not (OR (95% CI) = 1.03 (0.97-1.09), p = .37). Both variants replicated in UK Biobank. Elevated lipoprotein(a) concentrations ([Lp(a)]) and LPA variants associated with elevated [Lp(a)] have previously been associated with CAAD and CHD, including rs10455872. With electronic health record phenotypes in eMERGE and UKB, we replicated a previously known association and identified a novel locus associated with CAAD.


Asunto(s)
Estenosis Carotídea , Estudio de Asociación del Genoma Completo , Registros Electrónicos de Salud , Predisposición Genética a la Enfermedad , Genómica , Humanos , Lipoproteína(a)/genética , Modelos Genéticos , Polimorfismo de Nucleótido Simple
6.
BMC Med Genomics ; 13(1): 105, 2020 07 25.
Artículo en Inglés | MEDLINE | ID: mdl-32711518

RESUMEN

BACKGROUND: Obstructive sleep apnea (OSA) is defined by frequent episodes of reduced or complete cessation of airflow during sleep and is linked to negative health outcomes. Understanding the genetic factors influencing expression of OSA may lead to new treatment strategies. Electronic health records (EHRs) can be leveraged to both validate previously reported OSA-associated genomic variation and detect novel relationships between these variants and comorbidities. METHODS: We identified candidate single nucleotide polymorphisms (SNPs) via systematic literature review of existing research. Using datasets available at Geisinger (n = 39,407) and Vanderbilt University Medical Center (n = 24,084), we evaluated associations between 40 previously implicated SNPs and OSA diagnosis, defined using clinical codes. We also evaluated associations between these SNPs and OSA severity measures obtained from sleep reports at Geisinger (n = 6571). Finally, we used a phenome-wide association study approach to help reveal pleiotropic genetic effects between OSA candidate SNPs and other clinical codes and laboratory values available in the EHR. RESULTS: Most previously reported OSA candidate SNPs showed minimal to no evidence for associations with OSA diagnosis or severity in the EHR-derived datasets. Three SNPs in LEPR, MMP-9, and GABBR1 validated for an association with OSA diagnosis in European Americans; the SNP in GABBR1 was associated following meta-analysis of results from both clinical populations. The GABBR1 and LEPR SNPs, and one additional SNP, were associated with OSA severity measures in European Americans from Geisinger. Three additional candidate OSA SNPs were not associated with OSA-related traits but instead with hyperlipidemia and autoimmune diseases of the thyroid. CONCLUSIONS: To our knowledge, this is one of the largest candidate gene studies and one of the first phenome-wide association studies of OSA genomic variation. Results validate genetic associates with OSA in the LEPR, MMP-9 and GABBR1 genes, but suggest that the majority of previously identified genetic associations with OSA may be false positives. Phenome-wide analyses provide evidence of mediated pleiotropy. Future well-powered genome-wide association analyses of OSA risk and severity across populations with diverse ancestral backgrounds are needed. The comprehensive nature of the analyses represents a platform for informing future work focused on understanding how genetic data can be useful to informing treatment of OSA and related comorbidities.


Asunto(s)
Registros Electrónicos de Salud , Etnicidad/genética , Predisposición Genética a la Enfermedad , Polimorfismo de Nucleótido Simple , Apnea Obstructiva del Sueño/genética , Apnea Obstructiva del Sueño/patología , Estudios de Casos y Controles , Femenino , Estudio de Asociación del Genoma Completo , Genotipo , Humanos , Masculino , Persona de Mediana Edad , Fenotipo
7.
Am J Obstet Gynecol ; 223(4): 559.e1-559.e21, 2020 10.
Artículo en Inglés | MEDLINE | ID: mdl-32289280

RESUMEN

BACKGROUND: Polycystic ovary syndrome is the most common endocrine disorder affecting women of reproductive age. A number of criteria have been developed for clinical diagnosis of polycystic ovary syndrome, with the Rotterdam criteria being the most inclusive. Evidence suggests that polycystic ovary syndrome is significantly heritable, and previous studies have identified genetic variants associated with polycystic ovary syndrome diagnosed using different criteria. The widely adopted electronic health record system provides an opportunity to identify patients with polycystic ovary syndrome using the Rotterdam criteria for genetic studies. OBJECTIVE: To identify novel associated genetic variants under the same phenotype definition, we extracted polycystic ovary syndrome cases and unaffected controls based on the Rotterdam criteria from the electronic health records and performed a discovery-validation genome-wide association study. STUDY DESIGN: We developed a polycystic ovary syndrome phenotyping algorithm on the basis of the Rotterdam criteria and applied it to 3 electronic health record-linked biobanks to identify cases and controls for genetic study. In the discovery phase, we performed an individual genome-wide association study using the Geisinger MyCode and the Electronic Medical Records and Genomics cohorts, which were then meta-analyzed. We attempted validation of the significant association loci (P<1×10-6) in the BioVU cohort. All association analyses used logistic regression, assuming an additive genetic model, and adjusted for principal components to control for population stratification. An inverse-variance fixed-effect model was adopted for meta-analysis. In addition, we examined the top variants to evaluate their associations with each criterion in the phenotyping algorithm. We used the STRING database to characterize protein-protein interaction network. RESULTS: Using the same algorithm based on the Rotterdam criteria, we identified 2995 patients with polycystic ovary syndrome and 53,599 population controls in total (2742 cases and 51,438 controls from the discovery phase; 253 cases and 2161 controls in the validation phase). We identified 1 novel genome-wide significant variant rs17186366 (odds ratio [OR]=1.37 [1.23, 1.54], P=2.8×10-8) located near SOD2. In addition, 2 loci with suggestive association were also identified: rs113168128 (OR=1.72 [1.42, 2.10], P=5.2×10-8), an intronic variant of ERBB4 that is independent from the previously published variants, and rs144248326 (OR=2.13 [1.52, 2.86], P=8.45×10-7), a novel intronic variant in WWTR1. In the further association tests of the top 3 single-nucleotide polymorphisms with each criterion in the polycystic ovary syndrome algorithm, we found that rs17186366 (SOD2) was associated with polycystic ovaries and hyperandrogenism, whereas rs11316812 (ERBB4) and rs144248326 (WWTR1) were mainly associated with oligomenorrhea or infertility. We also validated the previously reported association with DENND1A1. Using the STRING database to characterize protein-protein interactions, we found both ERBB4 and WWTR1 can interact with YAP1, which has been previously associated with polycystic ovary syndrome. CONCLUSION: Through a discovery-validation genome-wide association study on polycystic ovary syndrome identified from electronic health records using an algorithm based on Rotterdam criteria, we identified and validated a novel genome-wide significant association with a variant near SOD2. We also identified a novel independent variant within ERBB4 and a suggestive association with WWTR1. With previously identified polycystic ovary syndrome gene YAP1, the ERBB4-YAP1-WWTR1 network suggests involvement of the epidermal growth factor receptor and the Hippo pathway in the multifactorial etiology of polycystic ovary syndrome.


Asunto(s)
Síndrome del Ovario Poliquístico/genética , Receptor ErbB-4/genética , Transactivadores/genética , Proteínas Adaptadoras Transductoras de Señales/metabolismo , Adulto , Estudios de Casos y Controles , Registros Electrónicos de Salud , Femenino , Estudio de Asociación del Genoma Completo , Humanos , Hiperandrogenismo/genética , Infertilidad Femenina/genética , Persona de Mediana Edad , Oligomenorrea/genética , Quistes Ováricos/genética , Síndrome del Ovario Poliquístico/diagnóstico , Síndrome del Ovario Poliquístico/fisiopatología , Polimorfismo de Nucleótido Simple , Superóxido Dismutasa/genética , Factores de Transcripción/metabolismo , Proteínas Coactivadoras Transcripcionales con Motivo de Unión a PDZ , Proteínas Señalizadoras YAP
8.
BMC Med ; 17(1): 168, 2019 08 28.
Artículo en Inglés | MEDLINE | ID: mdl-31455332

RESUMEN

BACKGROUND: The alpha-adrenergic agonist phenylephrine is often used to treat hypotension during anesthesia. In clinical situations, low blood pressure may require prompt intervention by intravenous bolus or infusion. Differences in responsiveness to phenylephrine treatment are commonly observed in clinical practice. Candidate gene studies indicate genetic variants may contribute to this variable response. METHODS: Pharmacological and physiological data were retrospectively extracted from routine clinical anesthetic records. Response to phenylephrine boluses could not be reliably assessed, so infusion rates were used for analysis. Unsupervised k-means clustering was conducted on clean data containing 4130 patients based on phenylephrine infusion rate and blood pressure parameters, to identify potential phenotypic subtypes. Genome-wide association studies (GWAS) were performed against average infusion rates in two cohorts: phase I (n = 1205) and phase II (n = 329). Top genetic variants identified from the meta-analysis were further examined to see if they could differentiate subgroups identified by k-means clustering. RESULTS: Three subgroups of patients with different response to phenylephrine were clustered and characterized: resistant (high infusion rate yet low mean systolic blood pressure (SBP)), intermediate (low infusion rate and low SBP), and sensitive (low infusion rate with high SBP). Differences among clusters were tabulated to assess for possible confounding influences. Comorbidity hierarchical clustering showed the resistant group had a higher prevalence of confounding factors than the intermediate and sensitive groups although overall prevalence is below 6%. Three loci with P < 1 × 10-6 were associated with phenylephrine infusion rate. Only rs11572377 with P = 6.09 × 10-7, a 3'UTR variant of EDN2, encoding a secretory vasoconstricting peptide, could significantly differentiate resistant from sensitive groups (P = 0.015 and 0.018 for phase I and phase II) or resistant from pooled sensitive and intermediate groups (P = 0.047 and 0.018). CONCLUSIONS: Retrospective analysis of electronic anesthetic records data coupled with the genetic data identified genetic variants contributing to variable sensitivity to phenylephrine infusion during anesthesia. Although the identified top gene, EDN2, has robust biological relevance to vasoconstriction by binding to endothelin type A (ETA) receptors on arterial smooth muscle cells, further functional as well as replication studies are necessary to confirm this association.


Asunto(s)
Agonistas de Receptores Adrenérgicos alfa 1/administración & dosificación , Anestesia/efectos adversos , Hipotensión/inducido químicamente , Hipotensión/genética , Fenilefrina/administración & dosificación , Adulto , Presión Sanguínea/efectos de los fármacos , Femenino , Estudio de Asociación del Genoma Completo , Humanos , Infusiones Intravenosas , Embarazo , Estudios Retrospectivos
9.
Front Genet ; 10: 511, 2019.
Artículo en Inglés | MEDLINE | ID: mdl-31249589

RESUMEN

Uterine fibroids affect up to 77% of women by menopause and account for up to $34 billion in healthcare costs each year. Although fibroid risk is heritable, genetic risk for fibroids is not well understood. We conducted a two-stage case-control meta-analysis of genetic variants in European and African ancestry women with and without fibroids classified by a previously published algorithm requiring pelvic imaging or confirmed diagnosis. Women from seven electronic Medical Records and Genomics (eMERGE) network sites (3,704 imaging-confirmed cases and 5,591 imaging-confirmed controls) and women of African and European ancestry from UK Biobank (UKB, 5,772 cases and 61,457 controls) were included in the discovery genome-wide association study (GWAS) meta-analysis. Variants showing evidence of association in Stage I GWAS (P < 1 × 10-5) were targeted in an independent replication sample of African and European ancestry individuals from the UKB (Stage II) (12,358 cases and 138,477 controls). Logistic regression models were fit with genetic markers imputed to a 1000 Genomes reference and adjusted for principal components for each race- and site-specific dataset, followed by fixed-effects meta-analysis. Final analysis with 21,804 cases and 205,525 controls identified 326 genome-wide significant variants in 11 loci, with three novel loci at chromosome 1q24 (sentinel-SNP rs14361789; P = 4.7 × 10-8), chromosome 16q12.1 (sentinel-SNP rs4785384; P = 1.5 × 10-9) and chromosome 20q13.1 (sentinel-SNP rs6094982; P = 2.6 × 10-8). Our statistically significant findings further support previously reported loci including SNPs near WT1, TNRC6B, SYNE1, BET1L, and CDC42/WNT4. We report evidence of ancestry-specific findings for sentinel-SNP rs10917151 in the CDC42/WNT4 locus (P = 1.76 × 10-24). Ancestry-specific effect-estimates for rs10917151 were in opposite directions (P-Het-between-groups = 0.04) for predominantly African (OR = 0.84) and predominantly European women (OR = 1.16). Genetically-predicted gene expression of several genes including LUZP1 in vagina (P = 4.6 × 10-8), OBFC1 in esophageal mucosa (P = 8.7 × 10-8), NUDT13 in multiple tissues including subcutaneous adipose tissue (P = 3.3 × 10-6), and HEATR3 in skeletal muscle tissue (P = 5.8 × 10-6) were associated with fibroids. The finding for HEATR3 was supported by SNP-based summary Mendelian randomization analysis. Our study suggests that fibroid risk variants act through regulatory mechanisms affecting gene expression and are comprised of alleles that are both ancestry-specific and shared across continental ancestries.

10.
Pac Symp Biocomput ; 24: 272-283, 2019.
Artículo en Inglés | MEDLINE | ID: mdl-30864329

RESUMEN

The link between cardiovascular diseases and neurological disorders has been widely observed in the aging population. Disease prevention and treatment rely on understanding the potential genetic nexus of multiple diseases in these categories. In this study, we were interested in detecting pleiotropy, or the phenomenon in which a genetic variant influences more than one phenotype. Marker-phenotype association approaches can be grouped into univariate, bivariate, and multivariate categories based on the number of phenotypes considered at one time. Here we applied one statistical method per category followed by an eQTL colocalization analysis to identify potential pleiotropic variants that contribute to the link between cardiovascular and neurological diseases. We performed our analyses on ~530,000 common SNPs coupled with 65 electronic health record (EHR)-based phenotypes in 43,870 unrelated European adults from the Electronic Medical Records and Genomics (eMERGE) network. There were 31 variants identified by all three methods that showed significant associations across late onset cardiac- and neurologic- diseases. We further investigated functional implications of gene expression on the detected "lead SNPs" via colocalization analysis, providing a deeper understanding of the discovered associations. In summary, we present the framework and landscape for detecting potential pleiotropy using univariate, bivariate, multivariate, and colocalization methods. Further exploration of these potentially pleiotropic genetic variants will work toward understanding disease causing mechanisms across cardiovascular and neurological diseases and may assist in considering disease prevention as well as drug repositioning in future research.


Asunto(s)
Enfermedades Cardiovasculares/genética , Pleiotropía Genética , Enfermedades del Sistema Nervioso/genética , Adulto , Anciano , Biología Computacional , Registros Electrónicos de Salud , Femenino , Estudios de Asociación Genética , Predisposición Genética a la Enfermedad , Humanos , Masculino , Persona de Mediana Edad , Análisis Multivariante , Fenotipo , Polimorfismo de Nucleótido Simple , Sitios de Carácter Cuantitativo
11.
NPJ Genom Med ; 4: 3, 2019.
Artículo en Inglés | MEDLINE | ID: mdl-30774981

RESUMEN

We conducted an electronic health record (EHR)-based phenome-wide association study (PheWAS) to discover pleiotropic effects of variants in three lipoprotein metabolism genes PCSK9, APOB, and LDLR. Using high-density genotype data, we tested the associations of variants in the three genes with 1232 EHR-derived binary phecodes in 51,700 European-ancestry (EA) individuals and 585 phecodes in 10,276 African-ancestry (AA) individuals; 457 PCSK9, 730 APOB, and 720 LDLR variants were filtered by imputation quality (r 2 > 0.4), minor allele frequency (>1%), linkage disequilibrium (r 2 < 0.3), and association with LDL-C levels, yielding a set of two PCSK9, three APOB, and five LDLR variants in EA but no variants in AA. Cases and controls were defined for each phecode using the PheWAS package in R. Logistic regression assuming an additive genetic model was used with adjustment for age, sex, and the first two principal components. Significant associations were tested in additional cohorts from Vanderbilt University (n = 29,713), the Marshfield Clinic Personalized Medicine Research Project (n = 9562), and UK Biobank (n = 408,455). We identified one PCSK9, two APOB, and two LDLR variants significantly associated with an examined phecode. Only one of the variants was associated with a non-lipid disease phecode, ("myopia") but this association was not significant in the replication cohorts. In this large-scale PheWAS we did not find LDL-C-related variants in PCSK9, APOB, and LDLR to be associated with non-lipid-related phenotypes including diabetes, neurocognitive disorders, or cataracts.

12.
Am J Hum Genet ; 104(1): 55-64, 2019 01 03.
Artículo en Inglés | MEDLINE | ID: mdl-30598166

RESUMEN

Phenome-wide association studies (PheWASs) have been a useful tool for testing associations between genetic variations and multiple complex traits or diagnoses. Linking PheWAS-based associations between phenotypes and a variant or a genomic region into a network provides a new way to investigate cross-phenotype associations, and it might broaden the understanding of genetic architecture that exists between diagnoses, genes, and pleiotropy. We created a network of associations from one of the largest PheWASs on electronic health record (EHR)-derived phenotypes across 38,682 unrelated samples from the Geisinger's biobank; the samples were genotyped through the DiscovEHR project. We computed associations between 632,574 common variants and 541 diagnosis codes. Using these associations, we constructed a "disease-disease" network (DDN) wherein pairs of diseases were connected on the basis of shared associations with a given genetic variant. The DDN provides a landscape of intra-connections within the same disease classes, as well as inter-connections across disease classes. We identified clusters of diseases with known biological connections, such as autoimmune disorders (type 1 diabetes, rheumatoid arthritis, and multiple sclerosis) and cardiovascular disorders. Previously unreported relationships between multiple diseases were identified on the basis of genetic associations as well. The network approach applied in this study can be used to uncover interactions between diseases as a result of their shared, potentially pleiotropic SNPs. Additionally, this approach might advance clinical research and even clinical practice by accelerating our understanding of disease mechanisms on the basis of similar underlying genetic associations.


Asunto(s)
Enfermedad/genética , Registros Electrónicos de Salud , Estudios de Asociación Genética , Fenotipo , Polimorfismo de Nucleótido Simple/genética , Enfermedades Autoinmunes/genética , Enfermedades Cardiovasculares/genética , Epigenómica , Humanos
13.
BMC Bioinformatics ; 20(1): 46, 2019 Jan 22.
Artículo en Inglés | MEDLINE | ID: mdl-30669967

RESUMEN

BACKGROUND: The development of sequencing techniques and statistical methods provides great opportunities for identifying the impact of rare genetic variation on complex traits. However, there is a lack of knowledge on the impact of sample size, case numbers, the balance of cases vs controls for both burden and dispersion based rare variant association methods. For example, Phenome-Wide Association Studies may have a wide range of case and control sample sizes across hundreds of diagnoses and traits, and with the application of statistical methods to rare variants, it is important to understand the strengths and limitations of the analyses. RESULTS: We conducted a large-scale simulation of randomly selected low-frequency protein-coding regions using twelve different balanced samples with an equal number of cases and controls as well as twenty-one unbalanced sample scenarios. We further explored statistical performance of different minor allele frequency thresholds and a range of genetic effect sizes. Our simulation results demonstrate that using an unbalanced study design has an overall higher type I error rate for both burden and dispersion tests compared with a balanced study design. Regression has an overall higher type I error with balanced cases and controls, while SKAT has higher type I error for unbalanced case-control scenarios. We also found that both type I error and power were driven by the number of cases in addition to the case to control ratio under large control group scenarios. Based on our power simulations, we observed that a SKAT analysis with case numbers larger than 200 for unbalanced case-control models yielded over 90% power with relatively well controlled type I error. To achieve similar power in regression, over 500 cases are needed. Moreover, SKAT showed higher power to detect associations in unbalanced case-control scenarios than regression. CONCLUSIONS: Our results provide important insights into rare variant association study designs by providing a landscape of type I error and statistical power for a wide range of sample sizes. These results can serve as a benchmark for making decisions about study design for rare variant analyses.


Asunto(s)
Simulación por Computador/normas , Estudios de Asociación Genética/métodos , Tamaño de la Muestra , Humanos , Modelos Genéticos , Proyectos de Investigación
14.
Mol Psychiatry ; 24(5): 626-632, 2019 05.
Artículo en Inglés | MEDLINE | ID: mdl-30617273

RESUMEN

With the urgency to treat patients more effectively for opioid use disorder in the midst of the opioid epidemic, a key area for precision medicine is to improve individualized medication-assisted treatment for opioid use disorder. The expansion of medication-assisted treatment is a key to reducing illicit opioid use, preventing opioid overdose deaths, and reducing the comorbidities and societal impacts of opioid use disorder. The most common medication for opioid use disorder will soon be buprenorphine. Research to date shows the successful impact of buprenorphine treatment, including the pharmacogenomics of buprenorphine response and treatment efficacy. Buprenorphine is also a promising treatment for depression and anxiety, and neonatal opioid withdrawal syndrome (NOWS). However, the rates of success with medication-assisted treatment for opioid use disorder, particularly at the beginning of treatment, still show many individuals relapsing to illicit opioid use. With the scope of the opioid crisis, there is an urgent need for expansion of buprenorphine treatment research to provide critical information for improving outcomes of opioid use disorder. Implementing the best strategies for opioid use disorder treatment is of dire urgency and will save lives.


Asunto(s)
Buprenorfina/farmacología , Buprenorfina/uso terapéutico , Trastornos Relacionados con Opioides/tratamiento farmacológico , Analgésicos Opioides/uso terapéutico , Humanos , Antagonistas de Narcóticos , Epidemia de Opioides/tendencias , Síndrome de Abstinencia a Sustancias/tratamiento farmacológico , Resultado del Tratamiento
15.
Genes Immun ; 20(7): 555-565, 2019 09.
Artículo en Inglés | MEDLINE | ID: mdl-30459343

RESUMEN

Resting-state white blood cell (WBC) count is a marker of inflammation and immune system health. There is evidence that WBC count is not fixed over time and there is heterogeneity in WBC trajectory that is associated with morbidity and mortality. Latent class mixed modeling (LCMM) is a method that can identify unobserved heterogeneity in longitudinal data and attempts to classify individuals into groups based on a linear model of repeated measurements. We applied LCMM to repeated WBC count measures derived from electronic medical records of participants of the National Human Genetics Research Institute (NHRGI) electronic MEdical Record and GEnomics (eMERGE) network study, revealing two WBC count trajectory phenotypes. Advancing these phenotypes to GWAS, we found genetic associations between trajectory class membership and regions on chromosome 1p34.3 and chromosome 11q13.4. The chromosome 1 region contains CSF3R, which encodes the granulocyte colony-stimulating factor receptor. This protein is a major factor in neutrophil stimulation and proliferation. The association on chromosome 11 contain genes RNF169 and XRRA1; both involved in the regulation of double-strand break DNA repair.


Asunto(s)
Recuento de Leucocitos/métodos , Leucocitos/clasificación , Adulto , Anciano , Bases de Datos Genéticas , Registros Electrónicos de Salud , Femenino , Estudio de Asociación del Genoma Completo , Humanos , Análisis de Clases Latentes , Masculino , Persona de Mediana Edad , Fenotipo , Polimorfismo de Nucleótido Simple/genética , Proteínas/genética , Receptores del Factor Estimulante de Colonias/genética , Ubiquitina-Proteína Ligasas/genética
16.
Genet Epidemiol ; 43(1): 63-81, 2019 02.
Artículo en Inglés | MEDLINE | ID: mdl-30298529

RESUMEN

The Electronic Medical Records and Genomics (eMERGE) network is a network of medical centers with electronic medical records linked to existing biorepository samples for genomic discovery and genomic medicine research. The network sought to unify the genetic results from 78 Illumina and Affymetrix genotype array batches from 12 contributing medical centers for joint association analysis of 83,717 human participants. In this report, we describe the imputation of eMERGE results and methods to create the unified imputed merged set of genome-wide variant genotype data. We imputed the data using the Michigan Imputation Server, which provides a missing single-nucleotide variant genotype imputation service using the minimac3 imputation algorithm with the Haplotype Reference Consortium genotype reference set. We describe the quality control and filtering steps used in the generation of this data set and suggest generalizable quality thresholds for imputation and phenotype association studies. To test the merged imputed genotype set, we replicated a previously reported chromosome 6 HLA-B herpes zoster (shingles) association and discovered a novel zoster-associated loci in an epigenetic binding site near the terminus of chromosome 3 (3p29).


Asunto(s)
Registros Electrónicos de Salud , Predisposición Genética a la Enfermedad , Estudio de Asociación del Genoma Completo , Herpes Zóster/genética , Algoritmos , Población Negra/genética , Cromosomas Humanos/genética , Femenino , Haplotipos/genética , Homocigoto , Humanos , Masculino , Fenotipo , Polimorfismo de Nucleótido Simple/genética , Análisis de Componente Principal , Población Blanca/genética
17.
Curr Protoc Hum Genet ; 100(1): e80, 2019 01.
Artículo en Inglés | MEDLINE | ID: mdl-30516347

RESUMEN

Electronic health records contain patient-level data collected during and for clinical care. Data within the electronic health record include diagnostic billing codes, procedure codes, vital signs, laboratory test results, clinical imaging, and physician notes. With repeated clinic visits, these data are longitudinal, providing important information on disease development, progression, and response to treatment or intervention strategies. The near universal adoption of electronic health records nationally has the potential to provide population-scale real-world clinical data accessible for biomedical research, including genetic association studies. For this research potential to be realized, high-quality research-grade variables must be extracted from these clinical data warehouses. We describe here common and emerging electronic phenotyping approaches applied to electronic health records, as well as current limitations of both the approaches and the biases associated with these clinically collected data that impact their use in research. © 2018 by John Wiley & Sons, Inc.


Asunto(s)
Minería de Datos , Registros Electrónicos de Salud , Algoritmos , Humanos , Fenotipo
18.
AMIA Annu Symp Proc ; 2019: 572-581, 2019.
Artículo en Inglés | MEDLINE | ID: mdl-32308851

RESUMEN

Electronic health records are an increasingly important resource for understanding the interactions between patient health, environment, and clinical decisions. In this paper we report an empirical study of predictive modeling of seven patient outcomes using three state-of-the-art machine learning methods. Our primary goal is to validate the models by interpreting the importance of predictors in the final models. Central to interpretation is the use of feature importance scores, which vary depending on the underlying methodology. In order to assess feature importance, we compared univariate statistical tests, information-theoretic measures, permutation testing, and normalized coefficients from multivariate logistic regression models. In general we found poor correlation between methods in their assessment of feature importance, even when their performance is comparable and relatively good. However, permutation tests applied to random forest and gradient boosting models showed the most agreement, and the importance scores matched the clinical interpretation most frequently.


Asunto(s)
Registros Electrónicos de Salud , Aprendizaje Automático , Modelos Estadísticos , Evaluación del Resultado de la Atención al Paciente , Femenino , Humanos , Modelos Logísticos , Masculino , Modelos Teóricos
19.
PLoS One ; 14(12): e0226771, 2019.
Artículo en Inglés | MEDLINE | ID: mdl-31891604

RESUMEN

We performed a hypothesis-generating phenome-wide association study (PheWAS) to identify and characterize cross-phenotype associations, where one SNP is associated with two or more phenotypes, between thousands of genetic variants assayed on the Metabochip and hundreds of phenotypes in 5,897 African Americans as part of the Population Architecture using Genomics and Epidemiology (PAGE) I study. The PAGE I study was a National Human Genome Research Institute-funded collaboration of four study sites accessing diverse epidemiologic studies genotyped on the Metabochip, a custom genotyping chip that has dense coverage of regions in the genome previously associated with cardio-metabolic traits and outcomes in mostly European-descent populations. Here we focus on identifying novel phenome-genome relationships, where SNPs are associated with more than one phenotype. To do this, we performed a PheWAS, testing each SNP on the Metabochip for an association with up to 273 phenotypes in the participating PAGE I study sites. We identified 133 putative pleiotropic variants, defined as SNPs associated at an empirically derived p-value threshold of p<0.01 in two or more PAGE study sites for two or more phenotype classes. We further annotated these PheWAS-identified variants using publicly available functional data and local genetic ancestry. Amongst our novel findings is SPARC rs4958487, associated with increased glucose levels and hypertension. SPARC has been implicated in the pathogenesis of diabetes and is also known to have a potential role in fibrosis, a common consequence of multiple conditions including hypertension. The SPARC example and others highlight the potential that PheWAS approaches have in improving our understanding of complex disease architecture by identifying novel relationships between genetic variants and an array of common human phenotypes.


Asunto(s)
Aterosclerosis/genética , Negro o Afroamericano/genética , Pleiotropía Genética , Metagenómica , Fenómica , Anciano , Estudios Epidemiológicos , Femenino , Estudio de Asociación del Genoma Completo , Humanos , Masculino , Persona de Mediana Edad , Polimorfismo de Nucleótido Simple
20.
J Obes ; 2018: 3253096, 2018.
Artículo en Inglés | MEDLINE | ID: mdl-30363675

RESUMEN

The location and type of adipose tissue is an important factor in metabolic syndrome. A database of picture archiving and communication system (PACS) derived abdominal computerized tomography (CT) images from a large health care provider, Geisinger, was used for large-scale research of the relationship of volume of subcutaneous adipose tissue (SAT) and visceral adipose tissue (VAT) with obesity-related diseases and clinical laboratory measures. Using a "greedy snake" algorithm and 2,545 CT images from the Geisinger PACS, we measured levels of VAT, SAT, total adipose tissue (TAT), and adipose ratio volumes. Sex-combined and sex-stratified association testing was done between adipose measures and 1,233 disease diagnoses and 37 clinical laboratory measures. A genome-wide association study (GWAS) for adipose measures was also performed. SAT was strongly associated with obesity and morbid obesity. VAT levels were strongly associated with type 2 diabetes-related diagnoses (p = 1.5 × 10-58), obstructive sleep apnea (p = 7.7 × 10-37), high-density lipoprotein (HDL) levels (p = 1.42 × 10-36), triglyceride levels (p = 1.44 × 10-43), and white blood cell (WBC) counts (p = 7.37 × 10-9). Sex-stratified tests revealed stronger associations among women, indicating the increased influence of VAT on obesity-related disease outcomes particularly among women. The GWAS identified some suggestive associations. This study supports the utility of pursuing future clinical and genetic discoveries with existing imaging data-derived adipose tissue measures deployed at a larger scale.


Asunto(s)
Estudio de Asociación del Genoma Completo , Personal de Salud , Grasa Intraabdominal/diagnóstico por imagen , Síndrome Metabólico/diagnóstico por imagen , Obesidad/diagnóstico por imagen , Tomografía Computarizada por Rayos X , Adiposidad , Adolescente , Adulto , Índice de Masa Corporal , Recolección de Datos , Femenino , Humanos , Grasa Intraabdominal/patología , Masculino , Síndrome Metabólico/genética , Síndrome Metabólico/patología , Persona de Mediana Edad , Obesidad/genética , Obesidad/patología , Factores de Riesgo , Adulto Joven
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...